Bayesian Analysis of Comparative Survey Data
نویسندگان
چکیده
Bayesian hierarchical models provide a useful way of analyzing multilevel survey data. The Bayesian estimates have good statistical properties, make good predictions, and realistically account for clustering in the data. Still the Bayesian estimates can be biased in the presence of omitted variables and fixed effect models might sometimes be preferable. Bayesian statistics for model comparison and evaluation—posterior predictive checks and the Deviance Information Criterion—assist an empirical approach to distinguishing between hierarchical models and their alternatives. These ideas are illustrated with an analysis of migration data from 22 villages in the Nang Rong district of Thailand. Bayesian statistics can make a special contribution to comparative and historical social science. Comparative data are often not generated by a welldefined probability mechanism, so a researcher’s uncertainty may be better described by a degree-of-belief than the frequency behavior of sample statistics (Berk et al. 1994). Comparative and historical researchers also unearth rich qualitative information about particular countries, regions, and historical periods. In a classicial analysis, this nonsample information generally provides informal guides to model choice or the post hoc interpretation of results. Bayesian prior distributions explicitly incorporate non-sample information that often influences data analysis in a more informal way (Western and Jackman 1994). Prior information can have a large effect in comparative analysis because data sets can be small and collinear. Under these conditions, the final results may also depend closely on the choice of models. Bayesian statistics can incorporate uncertainty about the model specification, pushing inference in a more conservative direction (Western 1995). Finally, a key message of comparative social science is that social and political processes vary across countries, regions, and time periods. Bayesian hierarchical models help us analyze these kinds of heterogeneity (Western 1998; Western and Kleykamp 2004). This symposium on the analysis of multilevel survey data provides another opportunity to apply Bayesian methods to the special methodological problems of comparative research. Multilevel survey data are collected from, say, a dozen or more countries, perhaps at several points in time. This data structure shares some features with the pooled time series familiar to comparative researchers—observations are clustered by country and there is likely causal heterogeneity across countries. But unlike other comparative data, multilevel survey data provide enough information about each country to
منابع مشابه
A Bayesian Nominal Regression Model with Random Effects for Analysing Tehran Labor Force Survey Data
Large survey data are often accompanied by sampling weights that reflect the inequality probabilities for selecting samples in complex sampling. Sampling weights act as an expansion factor that, by scaling the subjects, turns the sample into a representative of the community. The quasi-maximum likelihood method is one of the approaches for considering sampling weights in the frequentist framewo...
متن کاملمقایسه روش بیزی (Bayesian) و کلاسیک در برآرد پارامترهای مدل رگرسیون لجستیک با وجود مقادیر گمشده در متغیرهای کمکی
Background and Aim: Logistic regression is an analytic tool widely used in medical and epidemiologic research. In many studies, we face data sets in which some of the data are not recorded. A simple way to deal with such "missing data" is to simply ignore the subjects with missing observations, and perform the analysis on cases for which complete data are available. Materials and Methods: We c...
متن کاملComparative Analysis of Machine Learning Algorithms with Optimization Purposes
The field of optimization and machine learning are increasingly interplayed and optimization in different problems leads to the use of machine learning approaches. Machine learning algorithms work in reasonable computational time for specific classes of problems and have important role in extracting knowledge from large amount of data. In this paper, a methodology has been employed to opt...
متن کاملA Bayesian Approach to Estimate Parameters of a Random Coefficient Transition Binary Logistic Model with Non-monotone Missing Pattern and some Sensitivity Analyses
A transition binary logistic model with random coefficients is proposed to model the unemployment statues of household members in two seasons of spring and summer. Data correspond to the labor force survey performed by Statistical Center of Iran in 2006. This model is introduced to take into account two kinds of correlation in the data one due to the longitudinal nature o...
متن کاملBayesian and Iterative Maximum Likelihood Estimation of the Coefficients in Logistic Regression Analysis with Linked Data
This paper considers logistic regression analysis with linked data. It is shown that, in logistic regression analysis with linked data, a finite mixture of Bernoulli distributions can be used for modeling the response variables. We proposed an iterative maximum likelihood estimator for the regression coefficients that takes the matching probabilities into account. Next, the Bayesian counterpart...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2005